Collection of Usage Information for Language Resources from Academic Articles

نویسندگان

  • Shunsuke Kozawa
  • Hitomi Tohyama
  • Kiyotaka Uchimoto
  • Shigeki Matsubara
چکیده

Recently, language resources (LRs) are becoming indispensable for linguistic researches. However, existing LRs are often not fully utilized because their variety of usage is not well known, indicating that their intrinsic value is not recognized very well either. Regarding this issue, lists of usage information might improve LR searches and lead to their efficient use. In this research, therefore, we collect a list of usage information for each LR from academic articles to promote the efficient utilization of LRs. This paper proposes to construct a text corpus annotated with usage information (UI corpus). In particular, we automatically extract sentences containing LR names from academic articles. Then, the extracted sentences are annotated with usage information by two annotators in a cascaded manner. We show that the UI corpus contributes to efficient LR searches by combining the UI corpus with a metadata database of LRs and comparing the number of LRs retrieved with and without the UI corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Citation and content analysis of Hormozgan Medical Journal

Introduction: One of the most important branches of scientometrics is citation analysis which offers patterns showing authors’ information searching behavior and authors’ authorship. Citation Analysis provides a pattern of authors’ Information-seeking behavior and also Content Analysis provides a pattern of authorship. This study aimed to analyze the citations and content of p...

متن کامل

From Academic to Journalistic Texts: A Qualitative Analysis of the Evaluative Language of Science

This study examined academic articles and journalistic reports in 5 disciplinary areas to explore how similar contents might attitudinally be realized in two different genres. To this end, 25 research articles and 210 news reports were carefully selected and underwent detailed discourse semantic and grammatical analyses with the purpose of identifying the evaluative linguistic patterns....

متن کامل

Automatic Acquisition of Usage Information for Language Resources

Recently, language resources (LRs) are becoming indispensable for linguistic research. Unfortunately, it is not easy to find their usages by searching the web even though they must be described in the Internet or academic articles. This indicates that the intrinsic value of LRs is not recognized very well. In this research, therefore, we extract a list of usage information for each LR to promot...

متن کامل

A Metadiscourse Analysis over Interactive VS Interactional Resources within English Academic Articles in Arts and Humanities

In this article, researchers set out to discover the metadiscourse markers in research articles written by both native and non-native English speakers. To this end, a total number of twenty research articles published by Iranian and native English speakers in highly reputed journals on Arts and Humanities domains were randomly selected from major databases including Science Direct, Noormagz, an...

متن کامل

Textual Metadiscourse Resources in Research Articles*

This study was motivated by three factors, which also contribute to its significance for today’s academic writing. First, research articles are the significant means of communication between the writers all over the world. Second, persuasion and organization are crucial notions in academic writing where the authors have to consider the academic audiences and their needs.  Third, some writers ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010